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Abstract 

We compared the number of faint stars detected in deep survey fields with the 
current stellar distribution model of the Galaxy and found that the detected number 
in the H band is significantly smaller than the predicted number. This indicates that 
M-dwarfs, the major component, are fewer in the halo and the thick disk. We used 
archived data of several surveys in both the north and south field of GOODS (Great 
Observatories Origins Deep Survey), MODS in GOODS-N, and ERS and CANDELS 
in GOODS-S. The number density of M-dwarfs in the halo has to be 20 ±13% relative 
to that in the solar vicinity, in order for the detected number of stars fainter than 
20.5 mag in the H band to match with the predicted value from the model. In 
the thick disk, the number density of M-dwarfs must be reduced (52 ± 13%) or the 
scale height must be decreased (~ 600 pc). Alternatively, overall fractions of the 
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halo and thick disks can be significantly reduced to achieve the same effect, because 
our sample mainly consists of faint M-dwarfs. Our results imply that the M-dwarf 
population in regions distant from the Galactic plane is significantly smaller than 
previously thought. We then discussed the implications this has on the suitability 
of the model predictions for the prediction of non-companion faint stars in direct 
imaging extrasolar planet surveys by using the best-fit number densities. 

Key words: stars: imaging — stars: low-mass — Galaxy: halo — Galaxy: disk 


1. Introduction 

Direct imaging surveys of extrasolar planets are conducted by using large telescopes 
(e.g. Bowler et al. 2013; Chauvin et al. 2014; Nielsen et al. 2013; Yamamoto et al. 2013). Many 
planet candidates, whose magnitudes reach around 22 mag in the H band (1.6 /mi) and K band 
(2.2 //rn) , were detected. However, only about 10 extrasolar planets have been confirmed 1 . In 
other words, most of the detected candidates are stars that are not companions, located in 
the same line of sight, despite the primary goal of detecting extrasolar planets. It is therefore 
important to estimate the number of unbound stars in a given field of view (FoV). 

The mean number of such stars can be estimated using a stellar distribution model of the 
Galaxy. The standard stellar distribution model was proposed based on optical observations 
(e.g. Bahcall & Soneira 1980; Gilmore & Reid 1983). Juric et al. (2008) modified parameters 
of this model, such as the scale height and length, using the SDSS (Sloan Digital Sky Survey) 
up to 22 AB mag in the r band (6231 A), in order to match with deeper optical star counts. 
The same distribution profiles of the aforementioned model are applicable to the infrared star 
counts (e.g. Jones et al. 1981; Wainscoat et al. 1992). Recently, some studies were conducted to 
optimize parameters of this model to match with the space telescope catalogues up to around 
15 mag in the near-infrared (NIR) band (e.g. Robin et al. 2003; Chang et al. 2012; Polido et al. 
2013; Czekaj et al. 2014). Nakajima et al. (2000) conducted deep imaging observations directed 
toward the north Galactic pole in the J band using Subaru Telescope, and their findings showed 
that observations were consistent with the model’s predictions, including M-dwarfs and brown 
dwarfs (L- and T-type stars). However, they could not quantitatively discuss the number 
densities of these stars due to the small sample size. 

The stellar distribution model of the Galaxy was evaluated only up to 15 mag in the NIR 
band, although detected stars are generally fainter for extrasolar planet surveys. There is some 
doubt as to whether this model can be used to estimate the number of such faint stars. It is 
highly probable that these stars are distant late-type dwarfs. Actually, it is difficult to establish 
the number densities of these dwarfs outside the solar vicinity. There are several studies focused 

1 Extrasolar Planets Encyclopaedia, http://exoplanet.eu/ (as at Feb. 2014) 
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only on the M-dwarf distribution (e.g. Pirzkal et al. 2009; Ryan et al. 2011; Holwerda et al. 
2014). It is however too early for a clear consensus on how M-dwarfs are distributed to have 
emerged. This study seeks to constrain the number densities of distant M-dwarfs using the star 
counts in deep NIR imaging. Furthermore, we apply the results to the extrasolar planet survey 
data in order to estimate the number of held stars (i.e. non-companions) in a survey held. 

In Section 2, we explain the employed stellar distribution model of the Galaxy. In 
Section 3, we discuss the used observational data for model optimization. Section 4 contains a 
comparison of the observations with the results predicted by the stellar distribution model and 
a discussion on how we modified the model’s parameters to agree with the observational data. 
Then, this model is applied to the extrasolar planet survey data in Section 5. 


2. Model 

2.1. Stellar Distribution Model of the Galaxy 


We employed the standard stellar distribution model described as follows. There are 
three components in the model: the thin disk (D i), thick disk (ZU), and halo ( H ). In this 
study, we did not include a bulge component because no observations toward the Galactic 
center were used (see also Section 3). Using the Galactocentric coordinate system, the stellar 
number density (rif) at position (R,Z) is written as the sum of three components, where i and 
n Qi j represent the spectral type and the corresponding local space density (LSD). 

rn(R, Z ) = n Q ADi (R, Z) + D 2 (R, Z) + H(R, Z)\ (1) 


An exponential profile and a spheroidal profile are used to describe the distributions of the disks 
(Di and Df) and the halo, respectively. The distributions of the number density are written 
as follows (e.g. Chang et al. 2011): 
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where hn and hz represent the scale length and scale height. Subscripts 1 and 2 denote thin 
disk and thick disk, respectively, fd and fh represent the fractions of stars near the Sun that 
belong to the thick disk and halo component, k is the flattening parameter and p is the power 
law index of the halo. The position of the Sun is given by (R & , Zq)- 

The number of stars (Nf) is calculated for each sub-spectral type using the above model. 
We define a as the distance from the Sun to position (R,Z). Introducing the Galactic coordi- 
nates of the observed field (l,b), the position can be written as follows: 
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Table 1 . Parameters of the stellar distribution model. 


Component 

Symbol 

Value 

Position of the Sun 

Rq 

8000 pc 


Zq 

25 pc 

Thin Disk 

hii, l 

3700 pc 


hz, l 

260 pc 

Thick Disk 

tlR, 2 

5000 pc 


tlR, 2 

1040 pc 


fd 

0.091 

Halo 

K 

0.55 


p 

2.6 


h 

0.002 


R 

Z 


Rq + a 2 cos 2 b — 2R Q a cos b cos l 
a sin b + Z Q . 


1 1/2 


( 5 ) 

( 6 ) 


We estimated the number of stars whose apparent magnitude is between mi and m 2 . For each 
spectral type, apparent magnitudes (mi and m 2 ) correspond to distances (cp and a 2 ). The 
number of these stars in a field (S) is calculated by 

ra2 

Ni = S x / rii{a)a 2 da. (7) 

J a\ 

Note that this integral only depends on the distance a. The total number of predicted stars 
(N) is calculated by summation over the number of stars of all spectral types: 


iv = Ev. 

i 


( 8 ) 


2.2. Parameters of the Stellar Distribution Model 

Many studies were conducted to determine the parameters of the stellar distribution 
model including its scale heights, fractions, and LSDs. Chang et ah (2012) estimated param- 
eters based on the 2MASS catalogue. We believe these parameters to be the most reliable 
among those currently available. Fractions of respective components (/^ and fh) are the ratios 
as compared to the local stellar density. However, because in Chang et ah (2012) the cor- 
responding values are compared to the thin disk density, we needed to convert these values, 
and the employed parameters are shown in Table 1. All stars are assumed to follow the same 
distribution, regardless of their spectral types. The LSDs are provided in the next section. 
2.2.1. Local Space Density (LSD) 

Our data only includes targets fainter than 16 mag (see also Section 3). The focus of 
our discussion is on the region of H > 20 mag. As Figure 1 shows, stars in our sample with 
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Fig. 1. Relationship between the distance from the Sun and the apparent magnitude. Each line shows a 
representative spectral type in this magnitude range. The absolute magnitude of a YO-type star was taken 
from Kirkpatrick et al. (2012). Each value in parentheses represents the absolute magnitude in the H 
band. Giants are out of range. It is evident from this figure that contamination by giant stars is negligible 
in the observed magnitude range. 

magnitudes fainter than 20 mag are mainly M-dwarfs and brown dwarfs. We therefore confine 
the spectral type of our sample stars to the range from K2 to T7. The local number density 
of G-dwarfs is ~ 27% compared to that of K-dwarfs (calculated from values in Kirkpatrick et 
al. 2012). Taking the distance and stellar distribution functions into account, the number of 
G-dwarfs in the 16.5-17.5 bin is estimated to be ~ 35% compared to K-dwarfs, which means 
that G-dwarfs occupy ~ 3.5% of that bin. F-dwarfs (and earlier type stars) constitute an even 
smaller fraction. The number of giants is also negligible because giants need to be distant to 
become faint enough to enter the observed magnitude range (out of range in Figure 1). Such 
giants are very rare. 

We referred to five studies, described below, to establish the LSD for each sub-spectral 
type. The number of local stars published in these studies is shown in Figure 2. Two cases 
were adopted; Case 1 is the standard LSD, and Case 2 is the minimum LSD. These cases are 
shown in Figure 3. 

(i) Reid et al. (2004): We calculated the LSD from K2 to M6 using this study’s luminosity 
function in the J band. Reid et al. (2004) mentioned that J-band magnitudes are good 
indicators of the spectral types. The number of each spectral type was estimated using 
this relation. The spectral type bins used by Reid et al. (2004) were in some cases broader 
than the width used in this and other works (e.g. Ml, M2 are one bin). We split them into 
sub-bins in order to achieve equidistant spacing between all spectral bins. The numbers 
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Fig. 2. The number of stars taken from previous studies. The number is uniformly scaled to (20pc) 3 . The 
dashed line and the dotted line represent Case 1 (standard LSD) and Case 2 (minimum LSD), respectively. 
In the range from LO to L7, we use recent observational results, instead of the values predicted by the 
model. 



K2 K5 MO M5 LO L5 TO T5 T7 


Fig. 3. Local space density of each spectral type. The solid line and the dashed line represent Case 1 
(standard LSD) and Case 2 (minimum LSD), respectively. Dotted lines show the optimized LSD, which 
is discussed in Section 4. 
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were allocated to the sub-bins in the ratio of 1:2 in favor of fainter stars in cases the exact 
ratio was unknown. One consequence of this approach is that the LSD did not connect 
smoothly along the boundaries between MO-MI and M2-M3. However, this assumption 
had very little effect on our results, especially in the case of K-dwarfs. 

(ii) Cruz et al. (2003): This survey examined the number of stars from M7 to L8. Few 
stars with spectral types later than LO were included in their analysis because the survey 
was incomplete at such faint magnitudes. We focused on M7-M9 dwarfs for which we 
calculated the LSD considering the survey’s completeness and sky coverage. 

(iii) Reid et al. (2008): The number of K through L7 stars was examined in this paper. The 
focus was mainly on types raging between M9 and L7, while the other sub-spectral types 
were adopted from the previous work (Reid et al. 2004). The LSD was calculated using 
the procedure in (ii). 

(iv) Burgasser (2007): Stars later than L8 were rarely detected because of their faintness. 
Theoretical values were therefore used. Burgasser (2007) made theoretical model calcu- 
lations of the binary fraction of L- and T-type stars and thereby estimated their number 
densities. We used the number densities for a primary star with a mass function index of 
—0.5, a mass ratio index of 4, and a binary fraction of 0.1. 

(v) Caballero et al. (2008): Caballero et al. (2008) estimated the contamination by held late 
M-dwarfs and brown dwarfs (L- and T-type stars) in deep surveys. For this they calculated 
the LSD of M3-T7 quoted from (i), (ii), (iii), (iv), and Kirkpatrick et al. (1994). Their 
LSD was compatible with our standard LSD (Case 1). 

Our LSDs are consistent with other recent studies investigating stellar counts of faint 
stars (e.g. Ryan et al. 2011; Pirzkal et al. 2009), who also employed the above mentioned works. 
The derived LSD also matches with Kirkpatrick et al. (2011) for stars later than T6. On the 
other hands, LSDs later than L5 established in Reyle et al. (2010) are slightly smaller than 
ours. We will discuss how this affects our results in Section 4. Holwerda et al. (2014) suggested 
much larger central densities for M-dwarfs. However, as these densities are in conflict with the 
majority of other works we only note them for completeness. 

2.3. Relationship between Spectral Type and Absolute Magnitude 

The typical magnitudes of each spectral type were quoted from the list 2 provided by E. 
Mamajek as at June 18th 2013. This list compiles information about stellar characteristics, such 
as colors, magnitudes, and effective temperatures, on the basis of spectral standards and other 
studies (see also Pecaut et al. 2012 and Pecaut & Mamajek 2013 for the stellar information). 
Figure 4 shows the relationship between spectral type and absolute magnitude in the H, K S1 
and J band. We focus our discussion on the 77-band results. The K s - and J-band magnitudes 
are used only to confirm the 77-band results (see Section 4.1 and Section 4.2). 
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Fig. 4. Relationship between spectral types and absolute magnitudes in the H, K s , and J band taken 
from Mamajek’s list (see URL of footnote 2). 

3. Data 

The magnitude of planet candidates is generally H < 22 mag. In order to evaluate the 
aforementioned model for such faint star counts, we used archived data from deep and wide 
H - band imaging surveys: the GOODS (Great Observatories Origins Deep Survey) North and 
South fields. In this section, we explain how the data was gathered and analyzed. 

3.1. The GOODS-North Field 

We referred to the MODS (MOIRCS Deep Survey) region of the GOODS-North held 
(l = 125.9°, b = +54.8°), observed by Kajisawa et al. (2011) in the J, H , and K s bands with 
Subaru’s Multi-Object Infrared Camera and Spectrograph (MOIRCS: Ichikawa et al. 2006; 
Suzuki et al. 2008). The FoV of MOIRCS is 4' x 7' and four pointings covered an area of ~ 105 
square arcminutes in the survey. Only the detected number in the H band was compared with 
the predicted number. Stars were selected from this catalogue based on the method used in 
Kajisawa et al. (2011). This catalogue was released on the MODS homepage 3 . 

Kajisawa et al. (2011) performed photometry using different aperture sizes depending 
on whether they measure colors (aperture magnitudes) or the total flux (total magnitudes). 
We only used H - band results. However, the total magnitudes were given only in the K s band, 
and only the aperture magnitudes were available for the H band. We therefore performed the 
aperture correction for the H - band magnitudes using the same difference between the total and 
aperture magnitudes as in the K s band. The distinction between stars and galaxies was done 

3 http://www.astr.tohoku.ac.jp/MODS/wiki/ 


9 


based on the spectroscopic work by Wirtli et al. (2004). It is difficult to distinguish between 
L- and T-type brown dwarfs and galaxies by spectroscopy, but the result should remain largely 
unaffected because the total number of such stars detected in this survey was expected to be 
small. We only used stars brighter than 21.5 mag because colors and spectroscopic observations 
were not sufficient for fainter targets. We thus assumed all objects brighter than 20.5 mag to 
be stars, although some contamination (e.g. external galaxies) is to be expected in the 20.5 
to 21.5 mag bin. Figure 5(a) shows the comparison between the detected number and the 
predicted number per square degree. The uncertainty is given as lcr Poisson scatter. 

3.2. The GOODS-South Field 

We used two regions of the GOODS-South held (l = 223.6°, b = —54.4°): the ERS (Early 
Release Science) region (Windhorst et al. 2011) and the CANDELS (Cosmic Assembly Near- 
infrared Deep Extragalactic Legacy Survey) region (Guo et al. 2013). The ERS region was 
observed by Windhorst et al. (2011) using the Wide Field Camera 3 aboard the Hubble Space 
Telescope. Only data taken in the F160W filter (corresponding to the Ft band) was used. 
The total sky coverage was 9'.75 x 415. Stars have already been distinguished from galaxies 
by Windhorst et al. (2011). The CANDELS region was observed using the same instrument, 
achieving a total area of ~ 165 square arcminutes. We extracted stars from the point source 
catalogue created by Guo et al. (2013) based on the value of CLASS-STAR. Objects whose 
CLASS-STAR was larger than 0.8 were regarded as stars. We only used the photometric 
results of the F160W Liter data ( FLUX-AUTO ). We converted the Lux in the AB magnitude 
system to the Vega magnitude system (adopting a correction value of 1.5 mag only for the 
ERS region). The Lnal detection limit was approximately 24.5 mag in the ERS region, and 
23.5 mag in the CANDELS region. Therefore, we assumed that all objects in the ERS region 
and all, except those in the 22.5-23.5 mag bin within the CANDELS region are stars, and 
expected that the 22.5-23.5 mag bin in the CANDELS region includes some external galaxies. 
Figure 5(a) shows the comparison between the detected number and the predicted number per 
square degree. While the error bars of the predicted number represent lcr Poisson scatter, the 
uncertainties in the detected number of the ERS region are taken from Windhorst et al. (2011). 

4. Results and Discussion 

The results of GOODS-North (MODS region) and GOODS-South (ERS and CANDELS 
regions) are shown in the left and right side of Figure 5, respectively. In Figure 5(a), the 
detected numbers shown as bar graphs were compared with predicted numbers shown as a 
solid line (Case 1: standard case) and dotted line (Case 2: minimum case). Figure 5(b) 
shows fractions of Galactic components (the thin disk, thick disk, and halo) of M-dwarfs and 
other spectral types relative to all predicted numbers. There is a slight diLerence between the 
predicted numbers for Case 1 and Case 2 in Figure 5(a), but the features for these two cases 
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resembled each other when compared with observations. We therefore only discuss the results 
with respect to the predictions of Case 1, while the same conclusion holds for Case 2. It was 
not necessary to correct for interstellar extinction because observations were conducted in the 
H band and the Galactic pole data was used. 

We mention the results of comparison of each held in Subsection 4.1 and 4.2, then 
consider how to optimize the stelar distribution model to match with these deep NIR imaging 
in Subsection 4.3. 

4-1. Results of the GOODS-North Field 

The detected number was smaller than the predicted number in all bins in the left panel 
of Figure 5(a). We found from Figure 5(b) that almost all predicted stars are members of the 
thick disk or the halo, and that these stars are mainly M-dwarfs. This shows that the number 
of M-dwarfs in the thick disk and halo component is small compared to the model predictions. 
This result indicates that both the thick disk and halo have fewer M-dwarfs compared to the 
population in the solar vicinity. We also checked the /C r band star count using the LSD shown in 
Figure 3 and the relationship shown in Figure 4. The left side of Figure 6 shows the comparison 
between the detection and prediction. The same result holds if MODS data from the K s band 
is used. 

Stars other than M-dwarfs (e.g. L- and T-type stars) were rarely detected in the range of 
16.5 to 21.5 mag (see Figure 5(b)). Moreover, contamination from external galaxies is negligible 
because Wirtli et al. (2004) distinguished stars from galaxies based on their spectra. While the 
possibility of contamination by faint galaxies cannot be completely ruled out at magnitudes 
between 20.5 and 21.5 mag, our conclusion is conservative because the difference between the 
detected and predicted number would be even larger than that in case of including external 
galaxies by mistake, further enhancing the problem of missing faint stars in the thick disk and 
halo. 

4-2. Results of the GOODS-South Field 

The detected number was consistent with the predicted number for magnitudes brighter 
than 17.5 mag, but much smaller for fainter magnitudes in both ERS and CANDELS regions 
from Figure 5(a). Almost all predicted stars belong to the thick disk or halo, and most of these 
stars are M-dwarfs according to the model (see Figure 5(b)). This finding implies that fewer 
M-dwarfs exist in the thick disk and the halo compared to the prediction using the standard 
LSD. This tendency is corroborated by observations from the GOODS-South held using the 
F125W filter (corresponding to the J band), as can be seen in the right panel of Figure 6. The 
Ending is thus independent of the observational band. 

The catalogue by Windhorst et al. (2011) distinguished between stars and galaxies re- 
liably. All detected objects shown in the ERS region of Figure 5(a) are therefore regarded as 
stars. In the CANDELS region, stars and galaxies were identified based on the CLASS STAR 
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Fig. 5. (a) Comparison between the number of detected and predicted stars. The bar graph shows the 
detected number. The solid and dashed lines show the prediction from Case 1 and Case 2, respectively. 
Symbols of Case 1 and Case 2 hide each error bar. (b) Fraction of Galactic components (thin disk, thick 
disk, and halo) relative to all predicted numbers. The different types of gray/shaded regions show the 
fraction of M-dwarfs in the Galactic components, whereas the unshaded regions contain all the other 
spectral types. 


12 


5 


CUD 

«/) <u 

-O 


<u ■*-> 

i 8 


GOODS-North (Ks) 

r 5 

GOODS-South (J) 

■ 

4 - 

/; 

Casel 

3 - 

r ERS ; 

a Case2 

2 

^ - 

't^ T Ji 1 : 

-- H /- MODS 

■' " H — ■ / 

1 

0 

r mm II II II wM 

" H H H H H H j 


16.5 17.5 18.5 19.5 20.5 21.5 22.5 23.5 24.5 16.5 17.5 18.5 19.5 20.5 21.5 22.5 23.5 24.5 


Ks magnitude 


J magnitude 


Fig. 6. Comparison between the number of detected and predicted stars. The left panel shows the 
if s -band result of the MODS region and the right shows the J-band result of the ERS region. Legends 
and the field size are the same as in Figure 5(a). 

values. It is generally difficult to avoid contamination by galaxies completely for faint ob- 
jects when this method is employed. However, our conclusion as mentioned above does not 
change because the discrepancy between the detected and predicted stellar counts would only 
be enhanced, even in case of a small fraction of extragalactic contaminations. There was a 
discrepancy between the ERS and CANDELS star counts. Focusing on the 22.5-23.5 mag bin, 
the star count of CANDELS was larger than that of ERS. We believe this to be due to the 
inclusion of some galaxies in the CANDELS star counts. All other bins were consistent within 
the ERS uncertainties. 

4- 3. Discussion 

The detected number of Galactic stars is significantly smaller than the number predicted 
by the standard stellar distribution model of the Galaxy. Most of the stars belong to the thick 
disk or halo components and more than 80% of these stars are M-dwarfs according to the model. 
The detected stellar count and the predicted numbers from a standard parameterization of the 
Galaxy begin to differ significantly at H ~ 20.5 mag, a depth previously unachieved by surveys 
traditionally used for constraining the Galactic parameters (SDSS, 2MASS). Detected M-dwarfs 
fainter than 20.5 mag need to be more distant than ~ 1 kpc from the Sun. Our results imply 
that there are significantly fewer M-dwarfs at 1 kpc or more from the Galactic plane than 
previously thought. The number density of M-dwarfs in the thick disk or the scale height of 
the thick disk has to be decreased to match the model’s results with the detected number in 
both GOODS fields. In addition to this, the number density of M-dwarfs in the halo has to 
be decreased. Alternatively, employing significantly smaller fractions of the halo (fh) and thick 
disk (fd) can also explain the scarcity of M-dwarfs observed. As the deep NIR data we used 
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is predicted to contain mostly distant M-dwarfs (see Figure 5 b), both choices have similar 
effects in our model. Hayden et al. (2013) showed, for stars up to 3 kpc, that the stellar 
metallicity decreases with increasing distance from the Galactic mid-plane, which is explained 
by the scenario of halo and thick disk originating from dwarf galaxies having merged with the 
Galaxy (e.g. Searle & Zinn 1978; Abadi et al. 2003; Kirby et al. 2008). It is theoretically 
predicted that the cooling rate is small in metal-poor clouds and that high-mass stars tend to 
form more than low-mass stars in this condition. Stars belonging to the halo may form from 
metal-poor clouds. With respect to our results, this indicates that lower metallicity in the halo 
and thick disk component might be correlated with a M-dwarf deficit. 

We used three methods to improve upon the stellar distribution model’s parameters. 
Firstly, we changed the number densities of M-dwarfs (M0-M9) in the halo and thick disk, in 
order to match them with the detected number. Only the ERS region of the GOODS-South 
data was used because of its small uncertainty and large sample size. The best-fit case (shown 
in Figure 3 as dotted lines) was achieved with number densities of 20 ± 13% and 52 ± 13% for 
the halo and the thick disk, respectively, relative to the standard parameters (Case 1). We 
re-calculated the predicted numbers using the best-fit case, and show the comparison between 
the new predictions and the detected numbers in Figure 7. It is evident that the best-fit case 
in Figure 7 improves on the standard parameters shown in Figure 5(a). In Figure 7, a small 
disagreement can be seen in some bins, but this discrepancy falls within the 3cr uncertainty, 
considering Poisson noise of predictions and observational uncertainties. As mentioned in sub- 
section 2.2.1, our LSDs for stars later than L5 are slightly larger than those of Reyle et al. 
(2010), but the overall effect on our results is small considering the expected low number of 
these stars. Results of Pirzkal et al. (2005) also tentatively pointed into the direction of a 
deficit in the late-type halo subdwarf population. They showed that the late M-dwarf popula- 
tion could not be reproduced with their assumed halo to disk ratio (0.25%) and axial ratio of 
the halo (0.7). While we did not discuss the bulge, Sumi et al. (2011) showed that the number 
density of M-dwarfs in the bulge is largely similar to the mass function from Chabriere (2003). 

Secondly, we varied the halo and thick disk fractions ( fh and f c i). When smaller fractions 
were employed, the number of predicted stars was likewise reduced. Considering that most stars 
of 16.5-24.5 mag were M-dwarfs, employing overall smaller halo and thick disk fractions gave 
almost the same result as when reducing the number density of M-dwarfs in the halo and thick 
disk. However, even if we use the minimum fractions for the halo and thick disk within the 
uncertainty given by Chang et al. (2011), the discrepancy between the predicted and detected 
number still emerges. We still need to reduce the number density of M-dwarfs to 58% and 64% 
of the original value for the thick disk and halo to be within the bound of uncertainty given by 
Chang et al. (2011). 

Finally, we changed the scale height and length of the thick disk using the standard 
number density (Case 1). The scale height of the thick disk was reported in many studies 
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Fig. 7. Observed and predicted numbers of stars, using the best-fit number densities of M-dwarfs in the 
halo and the thick disk (see Subsection 4.3). The gray dot-dashed line represents the predicted number. 
Other legends and the field size are the same as in Figure 5(a). 

(including Chang et al. 2011; Polido et al. 2013; Carollo et al. 2010 and the references therein), 
based on star counts and stellar kinematics of the Galaxy. The scale height varies between 500 
and 1600 pc in these studies. Carollo et al. (2010) showed that the scale height of the thick disk 
is 990 ±40 pc using stellar kinematics of the Galaxy. Most of the previous studies based on star 
counts (see references in Chang et al. 2011) concluded that the typical scale height is about 
~ 1000 pc. Some studies, however, determined a smaller scale height. Polido et al. (2013) for 
example stated that the thick disk’s scale height is 640 ± 70 pc based on 2MASS star counts in 
the J, i/, and K s bands. We found that the predicted number was consistent with the detected 
number when the disk parameters from Polido et al. (2013) were employed. In addition, several 
works focused only on the late-type dwarf distribution (e.g. Pirzkal et al. 2009; Ryan et al. 
2011; Holwerda et al. 2014) estimated the scale height of the thin disk based on star counts 
using a single disk model. They all concluded that the scale height of early M-dwarfs is larger 
than that of later than M5-dwarfs. These results indicate that the thick disk component of 
late- type dwarfs cannot be clearly distinguished from the thin disk component. Similarly, Bovy, 
Rix, and Hogg (2012) concluded that there is no thick disk in the Galaxy, which is equivalent 
to employing a small scale height of the thick disk. Moreover, there is some variation in the 
scale length of the disks in the published literature (e.g. Bensby et al. 2011). We changed the 
scale length of the thick disk ranging from 2000 to 5000 pc, however, the predicted number was 
not considerably affected. 

In summary, it can be said that the predicted star count at H > 20.5 mag was not con- 
sistent with standard parameters from models construed from data with relatively low limiting 
magnitudes (e.g. Chang et al. 2012). They tend to over predict the number of distant faint 
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stars. The model matches the observations when we reduce the M-dwarf number density for 
the halo and thick disk component by 20 ± 13% and 52 ± 13%, respectively. Alternatively, the 
overall thick disk fraction (ff) and halo fraction (fh) can be lowered. Reducing the thick disk’s 
scale height to ~ 600 pc also made the model agree better with the data, but still required 
changing the M-dwarf density (or overall fraction fh) of the halo. The question whether the 
fraction of M-dwarfs in the thick disk is smaller to begin with, or whether its scale height is 
lower for late-types can ultimately not be resolved from the presented data. 

5. Application to the Extrasolar Planet Survey Data 

In Section 4, the model parameters were fit to the deep NIR survey result. We calculated 
the predicted number of stars, that are not companions, in the fields of the extrasolar planet 
survey, the SEEDS (Strategic Exploration of Exoplanets and Disks with Subaru) project, by 
using the best-fit number density. 

5.1. The SEEDS Data 

In the SEEDS project, extrasolar planets and protoplanetary disks were observed using 
direct imaging to elucidate universal rules of planetary formation (Tamura 2009). Observations 
were conducted using the Subaru telescope together with the high-contrast imaging detector 
HiCIAO (Suzuki et al. 2010) and the adaptive optics device A0188 (Hayano et al. 2010; Hodapp 
et ah 2008). The FoV of a single observation is 19"5 x 19"5. From observations conducted 
between October 2009 and April 2013, 62 stars were selected on the basis of the five criteria 
listed below. Figure 8 shows the Galactic coordinates of the selected 62 stars. The total fields 
measured 6.5 square arcminutes. 

(I) Stars with the Galactic coordinates (longitude: l, latitude: h) in the following range: 

20° < \l\ O 10° < \b\ <40° . 

This requirement excludes stars in the bulge and the Galactic plane subject to heavy 
extinction. The region of |6| > 40° was omitted due to small sample size. This omission 
has little effect on the final results. 

(II) Stars observed in the H band. 

For the SEEDS project, most observations were conducted in the Et band, and therefore, 
only ih-band data were used. 

(III) Stars observed using the direct imaging + angular differential imaging (DI+ADI: Marois 
et al. 2006) mode. 

The DI+ADI mode is very effective for detecting point sources. We excluded more than 
50 stars which were not observed in this mode. 

(IV) Sufficient exposure time. 

The detection limit improves with longer exposure times. We selected only deep observa- 
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Fig. 8. Galactic coordinates of 62 stars observed in the SEEDS project. We excluded stars near the 
Galactic plane and/or the Galactic center (GC). 

tions whose exposure time was longer than 15 minutes. 

(V) Data is available for analysis. 

Despite satisfying requirements (I) through (IV), image data was not available for about 

40 targets and could not be included in this study. 

Point sources detected near primary stars were determined to be companions (binaries or 
planets) by measuring their proper motions. While it would be preferable to use only confirmed 
unbound stars, we also included stars whose binary status could not be confirmed to extend the 
sky coverage and increase the sample size. This is justified by the large separation and faintness 
of many point sources lacking proper motion confirmation, rendering the a-priori probability 
of these objects being part of the primary system very small. The analysis method is shown 
in Appendix 1. We then calculated the number of stars, that are not companions, using the 
best-fit number density. Figure 9 shows the comparison between the observed number and the 
predicted number in the whole SEEDS field, with a given uncertainty of la Poisson scatter. For 
reference, we also showed the predicted number using the standard case (Case 1) in Figure 9. 

5.2. Discussion on the SEEDS field 

Focusing on the result of the best-fit model, the detected number is consistent with 
the predicted number at 18.5-19.5 mag and fainter than 20.5 mag when we include the stars 
awaiting follow up observations (striped bins in Figure 9), but is larger in the other bins. 
The number density in the thin and thick disk may be increased to match with the detected 
number by being considered that there is little possibility that stars shown as stripe bins are 
companions, and that there is a few halo stars in the SEEDS field. Increasing the number 
density in the thin disk leads to a stronger deviation from the result of the GOODS field. The 
cause of this discrepancy could be explained by considering that the SEEDS field is about ten 
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Fig. 9. Observed and predicted number of stars in the SEEDS field, using the best-fit number density 
(dotted-dashed gray line) and the standard case (Case 1; solid black line). The whole SEEDS field (6.5 
square arc minutes) is shown. The detected and predicted number represented by the bar graph and the 
line graph, respectively. Striped bins show stars whose binary status has not been confirmed yet. 

time smaller than each GOODS held. The model result is still within 3cr uncertainty of the 
data. We then discuss the difference between the best-fit case and the standard case (Case 1). 
The predicted number in the standard case is larger for all bins, and might be more consistent 
with the detected number for magnitudes brighter than 20.5 mag compared to the best-fit 
case. However, employing the best-fit case is well-suited to reproducing the number of fainter 
stars (H > 20.5 nrag), because we produced the best-fit case to match with the number of 
detected faint stars in the GOODS fields. We thus propose employing the model using the 
best-fit number density, which includes smaller M-dwarf density in the halo and thick disk, 
when estimating the number of faint contaminating held stars in extrasolar planet surveys. 

We further discuss the discrepancy between the detected and the predicted number, 
especially with respect to stars that have unconfirmed companion status. There is a high 
probability that most of these stars are unbound stars. Yamamoto et al. (2013) showed that 
the fraction of stars with giant planets at larger separation is less than 17.9%, and an equal 
or lower fraction was estimated in previous studies (including the references in Yamamoto et 
al. 2013). While we cannot rule out contamination by external galaxies completely due to lack 
of multi-color observations in the SEEDS project, galaxies can still be excluded based on the 
FWHM of their PSF (point spread function) in most cases. The true number of detected stars 
would be smaller than the number shown in Figure 9, if any galaxies still remain in the sample of 
detected stars. Other effects, such as interstellar extinction, stars of the bulge component, and 
contamination by giants, can be neglected due to the selection and the magnitude range of the 
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sample (see Subsection 5.1 and Figure 1). Some stars in the SEEDS field belong to a cluster (e.g. 
the Pleiades open cluster). Therefore there is a possibility that low-mass stars in the cluster were 
included. We estimated the number of low-mass stars from the mass function of the Pleiades 
(e.g. Biliain et al. 2006), but the number is too small to explain the disagreement between 
the detected and predicted numbers. Consequently, it is considered that the discrepancy is 
attributed to the small sample size because it cannot be fully explained by the aforementioned 
causes. 

6. Summary 

We inspected predictions from a stellar distribution model from deep NIR imaging, 
in order to estimate the number of unbound stars in extrasolar planets surveys using direct 
imaging. We used data from regions in the GOODS-North (MODS region) and GOODS-South 
fields (ERS and CANDELS regions). The detected number of stars was compared with the 
predicted number using the standard stellar distribution model in the Galaxy. These numbers 
begin to differ significantly for H > 20.5 mag. The number density of M-dwarfs in the thick 
disk or the scale height of the thick disk have to be decreased to match with the detected 
number. In addition, M-dwarfs have to be much less abundant in the halo component than 
they are expected from the number of M-dwarfs in the solar vicinity. Alternatively, significantly 
reducing the overall halo and thick disk fraction yields similar results. The degeneracies in the 
model parameters (e.g., reducing scale height or component fraction) cannot ultimately be 
resolved with deep IR data alone. Regardless of the exact realization chosen, it is clear that 
previous models based on data not as deep (i.e. SDSS or 2MASS) over predict the number 
of M-dwarfs in the regions distant from the Galactic plane. This has implications for the 
prediction of the number of contaminating faint sources in direct imaging surveys for extrasolar 
planets. We consequently apply the model using our best-fit number density to the data of an 
actual extrasolar planet survey, the SEEDS data. The detected number of unbound stars shows 
good agreement with the prediction, but it is larger than expected in two of the magnitude bins 
(16.5-17.5 and 19.5-20.5). It is thought that the small sample size might cause this discrepancy. 
We recommend employing the model using the best-fit number density, which has a smaller 
M-dwarf density in the halo and thick disk, when estimating the number of non-companion 
faint stars. 

We thank R. Kooistra for spending the time to carefully proof-read our manuscript. We 
also thank the referee for all the insightful comments that helped improve this paper. 

Appendix 1. Analysis of the SEEDS data 

The samples were analyzed in accordance with the following steps. 
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(A) ADI analysis: 

The purpose of the ADI analysis is to improve the contrast between the primary star and 
its surroundings by reducing the quasi-static speckle noise resulting from deep exposures. 
Many methods for the analysis of ADI data have been proposed. In this study, we only 
considered the region farther than l."5 away from the primary star to neglect the influence 
of the primary star’s halo. Standard ADI analysis was employed as it provides the best 
signal-to-noise ratio in the region farther from the primary star (Konishi et al. 2013). LOCI 
(Locally Optimized Combination of Imaging; Lafreniere et al. 2007) analysis was not used 
because this method is often used to probe the closest vicinity of stars. The analysis was 
implemented in IRAF (Image Reduction and Analysis Facility). 

(B) Photometry of the primary star: 

The star’s magnitudes and the detection limits were calculated using photometry of the 
primary star. In cases where images of the primary star were saturated, we used the 
unsaturated data that was taken with the shorter exposure times and with a neutral 
density filter before and after the saturated images. Aperture photometry was performed 
with the APPHOT package of IRAF. The FWHM size of the stars was at most 8 pixels. 
Both 20 and 40 pixels aperture radii were employed and for the sky area we adopted an 
annulus from 40 to 100 pixels centered on the star. The magnitude of the primary star 
was taken from the SIMBAD catalogue 4 . 

(C) Evaluation of self-subtraction: 

The flux of point sources is reduced by self-subtraction in ADI analysis. Self-subtraction 
generally increases when point sources are near the primary star. However, in this study, 
self-subtraction is approximately constant because only the area farther than l."5 from the 
primary star was used. We estimated the amount of self-subtraction using the following 
method: Artificial stars (6 - 10 stars per image) were randomly embedded in the images, 
except for the edge and near the primary star. We then analyzed the data following the 
procedure described in (A) and (B) and repeated this process until the number of embedded 
stars exceeded 100 (ten or more iterations). We then averaged the self-subtractions over 
all data. In the area closest to the primary star (> 175) the amount of self-subtraction 
was always below 10%. 

(D) Calculation of the detection limit: 

We determined the 5 a detection limit by embedding stars of fixed magnitude using the 
SExtractor tool (Bertin & Arnouts 1996) to extract the point sources from the noise. Ten 
stars were randomly embedded at the same time in the image after ADI analysis; the 
number of stars extracted within 5a was then measured. This process was repeated 10 
times and the overall fraction of detected point sources was then calculated. Altering the 
magnitude of the artificial stars in steps of 0.1 mag, this process was repeated until the 


4 http: / /simbad.u-strasbg.fr /simbad / sim-fbasic 
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detected fraction dropped below 50% after correcting for self-subtraction. The derived 
detection limits were between 17.8 and 22.1 mag, with a median of 20.5 mag. 

(E) Investigating characteristics of detected stars: 

First, the magnitudes of detected stars were calculated. Photometry was conducted as 
described in procedure (B). We corrected for self-subtraction according to procedure (C). 
We then confirmed whether detected stars were companions or unbound stars based on 
results from previous studies (Brandt et al. 2013; Janson et al. 2013; Yamamoto et al. 
2013). The confirmation involved measuring the detected stars’ proper motions relative 
to the primary star. We used both unbound stars (background and foreground stars) as 
well as stars whose binary status could not be confirmed. 

Finally, the mean number of unbound stars was calculated for every field using the 
optimized stellar distribution model. The detection limit for each field was corrected when 
the predicted number was calculated. Because the signal-to-noise ratios were insufficient near 
the primary star and the edges of images, we removed these regions (20% of all fields). The 
predicted number in the SEEDS observations was the sum of the predicted number in each 
held. 
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